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HETEROLOGOUS PROTEIN PRODUCTION SYSTEM 
USING AVIAN CELLS 

BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

The present invention relates to novel expression systems that 
can produce biomedically important heterologous proteins Including 
human erythropoietin (hereatter "EPO"), and more specifically to the 
production of various heterologous proteins by transfecting DNA 
10 encoding the proteins, such as the genomic DNA encoding EPO into 
avian cells. 

2. Related Arts 

Many recombinant proteins used in medicine are relatively small 
and simple in their structure, and biologically functional proteins can be 

15 produced in prokaryote such as E. coli. However, some human 
proteins of medical interest, such as TPA (tissue plasminogen 
activator), Factor VIII, EPO, etc. are more complicated because 
biological function requires post-translational modification. For 
example, EPO is extensively glycosylated with the carbohydrate portion 

20 accounting for 40 % of the molecular mass. It has been shown that 
the carbohydrate portion of EPO is important for biological function. 
Accordingly, EPO produced in E. coli, yeast or insect is inactive or very 
weakly active in vivo, while EPO produced in COS or CHO cells was 
found to be fully active. Accordingly, those kinds of heterologous 

25 proteins have been produced only in mammalian cells. 

In the meantime, the avian system has been used for the study 
of gene expression in higher eukaryote for a long time. One of the first 
viruses to be linked to tumors was the Rous sarcoma virus of chicken, 
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and this virus was instrumental in demonstrating that the retroviral 
oncogene can originate from a cellular gene, leading to the concept of 
the protooncogen. Studies of gene expression have also been done 
using the RSV LTR promoter, which has often be used for high level 
expression of heterologous genes in mammalian cells. In addition, 
avian embryo cells have been used extensively in studies of various 
animal viruses. 

SUMMARY OF THE INVENTION 

The present invention is a research for the high level expression 
of eukaryotic heterologous proteins. It is an object of the present 
invention to provide a novel heterologous gene expression system 
which can produce proteins of higher eukaryotic cells. It is another 
object to provide the method of efRciently producing higher eukaryotic 
proteins, such as EPO, etc., which has been known to be active only 
when they are produced in a mammalian cell. It is a further object of 
the invention to provide the method of producing, especially, EPO 
among the eukaryotic proteins described above. 

To accomplish the objects of the present invention, the present 
invention provides a heterologous gene expression system comprising 
a DNA encoding a heterologous protein, a vector for receiving the DNA; 
and an avian cell for harboring the vector. 

The present invention also provides a method of producing a 
heterologous protein comprising the steps of culturing the expression 
system of claim 1 in media to express the heterologous gene, and 
purifying the heterologous proteins from the cell and the media. 

Preferably, the heterologous protein of the present invention is 
selected from the group consisting of those proteins that are known to 
be active only when expressed in mammalian cells (such as EPO, TPA, 
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Factor VIII, etc.) and preferably, the vector contains a promoter 
selected from the group consisting of SV early promoter, major 
immediate early promoter of human cytomegalovirus (hereafter "HCMV 
MIEP") and RSV LTR, and preferably, the avian cell is selected from 
5 the group consisting of duck embryo cell (hereafter "DE"), chicken 
embryo fibroblast (hereafter "CEF") and quail fibrosarcoma (hereafter 
"QT'), more preferably QT-VC which was isolated by the inventors. 
QT-VC was deposited to the Intemational Depository Authority, Korea 
Research Institute of Bioscience and Biotechnology Korean Collection 
10 for Type Culture, and assigned a deposit number of KCTC 0277BP on 
August 22, 1996. The deposited QT-VC was transfected with the 
expression vector containing SY-EPO cDNA as described in Fig. 8. 

More preferably, the DNA encoding the heterologous protein is 
genomic DNA or cDNA. 

15 Further, the present invention provides an EPO production 

system comprising a DNA encoding EPO, a vector for receiving the 
DNA, and an avian cell for harboring the vector. 

Moreover, the invention provides a method of producing EPO 
comprising the steps of inserting a DNA encoding EPO into a vector, 
20 transfecting the vector into an avian cell, and culturing the transfected 
avian ceil in media. 

Preferably, the avian cell of the EPO production system is DE or 
QT, and the DNA is a genomic DNA encoding EPO, more preferably, 
the DNA selected from the group consisting of SY, JM, SH and HE 
25 described in Fig. 5. 

Preferably, the vector has a promoter selected from the group 
consisting of SV early promoter, HCMV MIEP and RSV LTR. 

The present invention also provides an avian cell as a host for 
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expressing genes encoding mammalian proteins. 

Further, tlie present invention provides an novel EPO genomic 
sequence selected from the group consisting of SY, M, SH and HE 
described in Fig. 5, and also provides an novel EPO amino acid 
5 sequence selected from the group consisting of JM, SH and HE 
described in Fig. 6. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows the expression of the bacterial CAT gene in avian 
cells. DE and CEF cells were transfected with pRc/CMV containing 

10 (+) or lacking (-) the CAT sequence. CAT activity was measured by 
determining the amount of acetylated chloramphenicol (AC) produced 
from "C-chloramphenicol. The values shown are from one 
representative of more than five Independent assays. For this 
particular experiment, 10 of protein was reacted with '"C- 

15 chloramphenicol for 20 min at 37 "C. 

Fig. 2 shows the comparison of CAT gene expression between 
various cell types and between different promoters. The three 
promoter-CAT fusion constructs were transfected into DE, CEF, CHO- 
K1, and HeLa cells, and CAT activity was measured as described in 
20 Fig. 1. S, SV40 early promoter; C, HCMV MIEP; R, RSV LTR. The 
values shown are from one representative of three independent 
assays. For this particular experiment, 10 |ig of protein was reacted 
with ^*C-chioramphenicol for 30 min at 37 "C . 

Fig. 3 shows the efficiency of DNA transfection in various cells. 
25 pCMV-lacZ constructs was transfected into DE, CHO, Vero, HeLa, 
and 293T cells by calcium phosphate-DNA coprecipitation using the 
conditions used for the experiments shown in Fig. 2. Two days after 
transfection, cells were fixed and stained with X-gat. The number of 
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blue cells per 60 mm tissue culture plate was counted. The total 
number of cells between plates were comparable at 1-3 X 10^ 
Transfection efficiency was calculated relative to DE cells. 

Fig. 4 shows the schematic diagram for cloning of human EPO 
5 and construction of expression vectors. The five blocks represent the 
five coding regions of EPO. The first PGR was performed using 
primers 25 and 33. The amplified DNA fragment was cloned and 
subjected to a second PGR using primers 12 and 9. The wavy tale in 
primer 1 2 contains the nucleotide sequence from the first coding region. 
10 Therefore, the second PGR generates the entire coding sequence of 
EPO so that the first and the second coding regions are attached to 
fomi without intron between them. Primers 12 and 9 contain Hindlll 
linkers at their 5' ends, enabling cloning of the EPO genomic sequence 
into various expression vectors. 

15 Fig. 5 is various EPO genomic DNA sequences. SY, SH, HE 

and JM are the EPO genomic DNA sequences cloned by the present 
invention, and AM and Gl are the EPO genomic sequences which has 
been already reported. Since the intron between the first coding 
region and the second coding region was deleted during the cloning, 

20 the deleted intron is not shown in Fig. 5. 

Fig. 6 is various EPO amino acid sequences. SY, SH, HE and 
JM are the EPO amino acid sequences cloned by the present invention, 
and AM and Gl are the EPO amino acid sequences which have been 
already reported. The abbreviation of the amino acids are as follows: 

25 A: alanine R: arginine N: asparagine D: aspartic acid 

C:cystein Q:glutamine E: glutamic acid H:histidine 

I: isoleucine L; leucine K: lysine M: methionine 
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F: phenylalanine P: proline S: serine 

T: threonine W: tryptophan Y: tyrosine V: valine 

Fig. 7 shows the comparison of CAT gene expression between 
QT-VC and other mammalian cell lines. pCMV-CAT was transfected 
to QT-VC; CH0-K1, and Vero cells, and CAT activity was measured as 
described in Fig. 1 . The transfection efficiency, as measured by X-gal 
staining following cotransfection with pCMV-lacZ, was reproducibly 3- 
5 % in all cases. For this particular experiment, 50 \ig of protein were 
incubated with "C-chloramphenicol for one hour at 37 "C. 

Fig. 8 is the typical structure of a plasmid used to express EPO 
in QT cells. The two types of BamHI cassettes which could express 
the gene for human glutamine synthetase (GS) was made. In these 
BamHI cassettes, the GS cDNA sequence was flanked by the poly A 
sequence from the bovine growth hormone gene and one of the two 
promoters, the partial MMTV LTR (from -220 to +15 from the RNA start 
site) or the 220 bp HSV tk promoter. The BamHI fragment expressing 
GS was inserted into the BamHI site of pCI-neo (Promega, Madison, 
Wl, USA), resulting in a series of pIGA. The Hindlll fragment of the 
SY-EPO cDNA sequence was cloned into the Smal site of pIGA, 
generating the EPO expression vector, pIGA-EPO. 

Fig. 9 shows the production of EPO by QT-N4D4. QT-N4D4 
cells were grown to confluence in a 10 cm culture dish (day 0) in M-199 
containing 10 % FBS and 1 mM MSX. On day 3, the EPO level was 
measured. The cells were then split into 1:3 and seeded onto 10 cm 
dishes. On day 6, the cells were again reached confluence, and the 
medium was replaced with 10 ml flresh medium containing 2 % (•) or 
1 0 % (O) FBS. EPO levels were detennined by ELISA (R&D system, 
Minnesota, USA) 
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Fig. 10 shows the comparison of EPO concentration in DE (•) 
and QT-N4D4 (O) measured by ELISA and by in vitro bioassay. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The inventors have explored the possibility of using avian cells 
5 as a host cell for heterologous gene expression. We have chosen to 
use three avian cells; two embryonic cells from chicken and duck, and a 
quail fibrosarcoma line. We chose to use the chicken and duck 
embryo cells for the following the reasons. First, these embryonic 
cells can easily be prepared from eggs, and they divide rapidly, 

10 undergoing many passages. Second, chicken and duck cells can be 
grown at large scale with relatively low costs. Third, some avian cells, 
such as those from chicken embryos have already been used for 
medical products. For example, Influenza virus has been cultured in 
chicken eggs for the production of vaccines. Finally, the culture 

15 conditions, including media and temperature, required by avian embryo 
cells are virtually identical to those of mammalian cells, suggesting that 
the physiology of avian and mammalian cells is probably comparable. 
Further, the reason of choosing a QT cell line is that various 
transformed cell lines have been already developed, and it is easy to 

20 handle these cell lines to construct a pennanent cell line expressing a 
heterologous protein, and the culture conditions and media is similar to 
those of mammalian cells. 

I. Cells and Plasmids 

1. Cells 

25 The following Table 1 shows cells used in the experiment. 
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Table 1 



Ceils 


Source 


HeLa human cervical carcinoma cells 


ATCC CCL2 


Vero African green monkey kidney cells 


ATCC CCL81 


COS-7 African green monkey kidney cells 
transformed by wild-type T antigen of 
SV40 


ATCC CRL1651 


CH0-K1 Chinese hamster ovary cells 


ATCC CCL61 


NIH3T3 contacted-inhibited Swiss mouse 
embryo cells 


ATCC CRL1651 


Ad-5 transformed human embryonic 
kidney cells 293 


ATCC CRL1661 


SL-29 chicken embryo fibroblast cells 


ATCC CRL1590 


Duck embryo 


ATCC CCL141 
or prepared by the 
Inventors 


Quail fibrosarcoma line QT6 


ATCC CRL1708 


Quail fibrosarcoma line QT-VC 


Isolated by the inventors 
KCTC 0277BP 



All these cells except QT cell lines were grown in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine 
semm (FBS). QT cell lines were cultured In Ml 99 medium instead of 
DEME. Duck embryo was either obtained from ATCC CCL 141 or 
prepared by trypsinization of 10- to 13- day old decapitated duck 
embryos. These avian cells were grown in minimum essential medium 
(Eagle) supplemented with non-essential amino acids and Earie's 
balanced salt solution containing 10 % FBS.' These cells could be 
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maintained for approximately another 30 passages. Each medium 
used In this study was supplemented with 120 ng/ml penicillin G 
(Sigma P-3032; 1690 units per mg) and 200 fxg/ml streptomycin (Sigma 
S-9137; 750 units per mg). 

5 2. Plasmids 

To evaluate the efficiency of heterologous protein production in 
avian cells, pRc/RSV-CAT and pRc/CMV-CAT were constructed by 
inserting a Hindlll CAT cassette (Pharmacia, Piscataway, NJ) into the 
Hindlll sites of pRc/RSV and pRc/CMV (Invitrogen, San Diego, 

10 California, USA), respectively. For pSVCAT, the plasmid p9l8 was 
used, which has been already described by the inventors. For EPO 
expression vectors, three vectors were used. pCMV-gEPO was 
constructed by cloning the Hindlll fragments of the EPO genomic 
sequence Into the Hindll I site of pRc/CMV. pSV-gEPO was derived by 

15 replacing the CAT sequence of pSV918 with the genomic EPO 
sequence. pIGA-EPO has cDNA of EPO controlled by HCMV MIEP 
and the genes of NEO and glutamine synthetase (hereafter "GS"). To 
measure the transfection efficiency, the plasmid pCMV-lacZ was 
constructed by inserting bacterial lacZ fragment into the Hindlll site of 

20 pRc/CMV. 

11. DNA Transfection and Gene Expression Assays 

The inventore tested whether avian embryo cells could be used 
for high levels of heterologous gene expression instead of mammalian 
cells. Although avian embryo cells have been used to culture viruses, 
25 there was no report that heterologous proteins of higher eukaryotic cells 
were expressed in these ceils. To canry out the study, it is necessary 
to develop the method of efficient transfection to avian cells. That is, 
to express heterologous genes in avian cells, it is required to develop 
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the transfection technique of DNA to target cells. At present, we could 
not find any reports on DNA transfection of avian embryo cells. 
Accordingly, the inventors have developed the technique that CEF and 
DE cells can readily be transfected with DNA. 

5 Among the techniques available, we have chosen a method 

using calcium phosphate coprecipitation, because this works well for 
various adherent cells and can also be used for establishing permanent 
lines. We have tested many different conditions and found that the 
following procedure was optimum. 

10 When cultures were 50-70% confluent in a 1 00 mm culture dish, 

a total of 10 ng DNA in HBS buffer (140 mM NaCI, 5 mM KCI, 0.75 mM 
Na2HP04.2H20, 6 mM dextrose, 25 mM HEPES) was incubated with the 
cells for 30 min at room temperature. 10 ml of regular media 
containing FBS was added and incubated for 20 hrs at 37 "C, except for 

15 CH0-K1 (8 hours). Cells were then treated with 10 ml of 100 \iM 
chloroquine, and incubated for another 3 hours at 37 °C. After 
replacement with 10 mi of fresh media, the cells were grown for 1 to 2 
days. Culture supematants were collected and centrifuged at 1000 
rpm for 10 min to remove cells and debris. To measure transfection 

20 efRciency, cells were transfected with pCMV-lacZ, rinsed once with 
PBS 3 days after transfection, fixed with 0.5 % glutaraldehyde (in PBS) 
for 10 min, and washed twice for 2-10 min each with 4 ml PBS 
containing 1 mM MgClz. For X-gal staining, the staining solution [PBS 
containing 4 mM K3Fe(CN)6, 4 mM K4Fe(CN)6.3H20, 2 mM MgCb, and 

25 400 [ig per ml X-gal (in dimethylformamide)] was added to fixed cells, 
and incubated at 37 °C for 4 hours overnight. When the reaction was 
completed, cells were washed once with PBS. Stained cells were kept 
in PBS. 
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CAT assay was carried out as follows: 

Two to three days after transfection, cells were harvested, 
washed once with PBS, and resuspended in 0.25 M Tris-HCl (pH 7.5). 
Total proteins were prepared by 4 cycles of freeze/thawing followed by 
5 heating at 65 "C for 7 min. Equivalent amounts of protein were 
assayed for CAT activity at 37 "C for 30 min. The amount of protein 
and the reaction time varied, depending on the experiments. For 
example, the CAT activity of cell extracts prepared from DE cells was 
so high that only 10 jig protein and 20 to 30 min reaction time had to be 

10 used, and under this condition, levels of CAT activity in other 
mammalian cells were very low or undetectable. When CAT activity 
became detectable in other cells, virtually all ^*C-chloramphenicol was 
converted. The percent conversion of ^*C-chloramphenicol to its 
acetylated forms was determined by cutting out regions containing 

15 unreacted and acetylated fornis and quantifying the amount of 
radioactivity in each by liquid scintillation counting. 

III. Gene Expression in DE and CEF 

Gene expression efficiency of DE and CEF was measured using 
CAT gene. We initially chose to use a promoter from the major 

20 immediate-early region of HCMV, because this has been shown to 
drive a high level gene expression in many different cell types. In the 
plasmid pCMV-CAT, the bacterial CAT gene is placed under the control 
of the HCMV MIEP. As a negative control, the plasmid Rc/CMV 
containing the promoter but no CAT sequence was used. These 

25 plasmids were transfected into DE and CEF cells and CAT activity was 
measured to estimate the efficiency of transfection and gene 
expression. One representative result from several independent 
transfections is shown in Fig. 1. Transfection of a control plasmid 
resulted in undetectable levels of CAT activity in both cells. However, 
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transfection with pCMV-CAT resulted in readily detectable levels of 
CAT activity in both cells. In more than five independent transfection 
assays, the level of CAT activity was always higher in DE cells than in 
CEF ceils. The magnitude of difference in the level of CAT activity 
5 between the two cells ranged from 10- to 50-fold, depending on the 
experiment. This result indicated that avian cells were readily 
transfected with DNA and the heterologous genes could be efficiently 
expressed. 

IV. Comparison of Levels of Gene Expression between Avian 
10 and Mammalian Cells, and between Different Promoters 

We have compared the levels of gene expression between avian 
and mammalian cells, using three different promoters; 

(1) the SV40 early promoter, which is used during the eariy 
transcriptional phase of SV40 infection; 

5 (2) the HCMV MIEP, which drives the expression of IE! and IE2 

regulatory proteins, immediately after HCMV infection; 

(3) the RSV LTR from an avian retrovirus. 

These promoters are knovim to be powerful in mammalian cells, 
and have often been used for high level heterologous gene expression. 

20 These promoter-CAT fusion constructe were transfected into 

four different cell lines, DE, CEF, CH0-K1, and HeLa, and CAT activity 
measured to compare the efficiency of gene expression between 
promoters and between cell types. To make this comparison 
semi-quantitative, all transfections and CAT assays were performed at 

25 the same time and using identical conditions. One representative 
result of such experiments Is shown in Fig. 2. Here, 10 jig of cell 
extracts were incubated for 30 min in the CAT reaction. Under these 
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particular conditions, the levels of CAT expression driven from the three 
promoters were very low in CHO and HeLa cells (Fig. 2). Only after 
larger amounts of proteins were used for extended reaction time, was 
CAT activity detected. In contrast, CAT activity was readily detectable 
5 in the avian cells (Fig. 2), except for the SV40 promoter in CEF cells. 
It indicated that the expression in avian cells are more effective than 
that in mammalian cells. 

The most dramatic finding was that the HCMV MIEP was 
extremely powerful In DE cells. In Fig. 2, the conditions used for the 

10 CAT reaction were chosen to generate the reasonable levels of CAT 
activity in other samples. When the CAT reaction was perfonned 
under limiting conditions for the protein sample prepared from DE cells 
transfected with pCMV-CAT (i.e., when the CAT conversion was below 
50 %), the levels of CAT activity of all the other samples were virtually 

15 undetectable. Therefore, the magnitude of difference in CAT activity 
between the protein sample from DE cells transfected with HCMV-CAT 
and those from the other transfections is at least two orders of 
magnitude. These results suggested that heterologous genes could 
be expressed very efficiently under the control of the HCMV MIEP in 

20 DE cells. 

It is possible that the high levels of CAT expression seen in DE 
cells could be due to efficient transfection of the cell population, rather 
than an ability of these cells to support strong gene expression. To 
distinguish these possibilities, we transfected pCMV-lacZ into DE and 
25 various animal cells. After transfection, cells were stained with X-gal, 
and the number of blue cells were counted to estimate the transfection 
efficiency. As shown in Fig. 3, the number of stained cell was always 
comparable between DE and other animal cells, suggesting that the 
high levels of CAT expression in DE cells were due to high levels of 
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10 



expression In individual cells. 

V. Cloning of human erythropoietin 

To test whether DE cells could indeed be used for the 
expression of medically important human proteins, we have isolated the 
genomic DNA encoding the human EPO gene. We chose to use EPO 
as a model because it is a secreted protein, so we could test whether 
DE cells properly process secreted proteins. We also used a genomic 
clone of EPO instead of the cDNA, to assess whether human genes are 
properly spliced to produce functional mRNAs in DE ceils. 

DNAs for cloning of EPO were prepared with blood cells 
collected from four people. Human peripheral blood lymphocytes 
were isolated by Ficoll-Hypaque gradient centrifugation of 
heparin-treated blood cells. Total DNA was prepared and used for 
polymerase chain reaction using specific oligonucleotide primers (Fig. 
15 4). The region around the start codon was highly GC rich, so the EPO 
sequence was cloned by two steps of PGR using two different pairs of 
primers. 

To obtain the genomic DNA for EPO, total DNA was prepared by 
lysing human peripheral blood lymphocytes using TES (10 mM Tris-HCI 
20 pH 7.8; 1 mM EDTA; 0.7 % SDS) followed by the treatment with 
400 jig/mi proteinase K at 50 °C for 1 hour, phenol:chlorofomi 
extraction, and ethanol precipitation. The polymerase chain reaction 
(PGR) was perfomied using 0.1 fig of total genomic DNA and 
oligonucleotide primers specific to the EPO gene. 

25 Primer #25 (sense. 5' to 3'): GAAGCTGATAAGCTGATAAGG 

Primer #33 (antisense, 5' to 3'): TGTGAGATGGTTAGATCTCA 

The samples were amplified through 30 cycles that included the 
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following parameters; denaturatlon at 92 °C for 1 min, primer annealing 
at 55 °C for Imin, and primer extension at 72 ''C for 1 min. The DNA 
fragment amplified from this reaction did not contain the first 13 
nucleotides in the N-terminal region, so a second PCR was perfonned 
5 using the following primers (Underlined, Hindlll; Outlined, start codon 
and stop codons, respectively). The relative position of these primers 
are as shown in Fig. 4. Taq DNA polymerase (POSCO Chem, Korea) 
and pfr polymerase (STRATGENE, California, USA) were used to 
amplify DNA. 

1 0 Primer #1 2 (sense, 5' to 3*): 

CAAGCTTCGGAGATGGGGTGCACGAATGTCCTGCCTGGCTGTGGC 

Primer #9 (antisense, 5' to 3'): 

CAAGCTTTCATCTGTCCCCTGTCCTGC 

The amplified DNA from the second PCR was cloned into the 
15 pCRII (Invitrogen), from which the Hindlll fragment containing the 
genomic sequence of EPO was inserted into various expression 
vectors as described above. In this experiment, the amplified DNA 
was placed under the control of the HCMV MIEP or SV40 early 
promoter, generating pCMV-gEPO and pSV-gEPO respectively. SY- 
20 EPO whose amino acid sequence is identical to that of the already 
known EPO is used for the expression experiments in the sections VII 
and VIII (See the section VI). 

VI. Analysis of Nucleotide Sequences of Cloned EPO Genomes 

Genomic structure of EPO cloned by the above method is different from 
25 the natural EPO genome in vivo. That Is, wild type EPO genomic DNA 
has five coding regions and four introns between them. However, in 
the DNA cloned by the above method, the first coding region was fused 
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to the second coding region to fonri one coding region so tliat it has 
four coding regions and three introns (Fig. 4). 

The results from the analysis of EPO gene sequences isolated from 
four people suggested that nucleotide sequences of EPO cloned from 
5 these region are significantly different from those of the prior two EPOs 
(AM-EPO and GI-EPO) (Fig. 5) at the sites of intron. Such a 
difference was not due to the error which occurred during DNA 
amplification in the process of cloning. We repeated cloning and 
sequencing using DNAs prepared from same individuals (but at 
10 different times) and obtained the same nucleotide sequence. As 

another control, we amplified the already cloned EPO under the similar 
conditions, and detennined the nucleotide sequence. Again, we 
obtained the same nucleotide sequence. 

Amino acid sequences of four EPO genes, together with AM and Gl, 
15 are shown in Fig. 6. Amino acid sequences fi^m AM, Gl and SY are 
identical. However, amino acid sequences from three people (JM, SH, 
HE) different by two or three different amino acids from Gl- and AM- 
EPO, suggesting that there is a polymorphisms among people. When 
compared with AM- or GI-EPO, HE-EPO had three different amino 
20 acids at C-terminal, SH-EPO three different amino acid over the whole 
polypeptide, and JM-EPO two different amino acids, one at C-terminal 
and the other in the middle of polypeptide (See Fig. 6). For example, 
while AM-EPO and GI-EPO had serine, alanine, and valine at positions 
36, 100 and 170 respechvely, SH-EPO had arginlne, serine, and 
25 tyrosine. Further, while AM-EPO and GI-EPO had valine, lysine, and 
allginine at positions 170, 177, and 191, HE-EPO had tyrosine, 
glutamine, and glycine. In JM-EPO, lysine and tyrosine were present 
at positions 54 and 170, while they were threonine and valine. These 
results suggested that the EPO gene has a polymorphism in amino 
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acids sequence as well as DNA sequence. 

VII. Expression of EPO in DE Cells 

In this experiment, we compared levels of EPO expression 
between DE cells and other cell lines. 

5 EPO expression vectors were transfected into various cells 

including DE, CEF. CHO, HeLa. VERO, and 293T. We have included 
VERO cells because they are often used for heterologous gene 
expression, and 293T cells which drive very high levels of gene 
expression, presumably due to both the high frequency of DNA 

10 transfection and the presence of potent viral transactivators such as 
EIA, EIB, and large T antigen. Two to three days after transfection, 
levels of EPO in the culture supernatants were measured by the 
enzyme linked immunoadsorbent assay, and transfection efficiencies 
were detemiined by staining cells adhered on the culture with X-gal. 

15 Transfection efficiency was can-ied out by transfection of a lacZ 
expression vector together with an EPO expression vector as described 
in the section II. One representative result of this analysis is 
summarized in Table 2. 



Table 2 



Cell 


HCMVMIEP 


SV40 eariy 
promoter 


HCMV/SV40 


293 


314 


17.5 


18 


CHO 


139.4 


10.4 


13.5 


VERO 


250 


10.7 


23.5 


NIH3T3 


89 


79.4 


1.1 


DE 


4335 


13.8 


314.8 



When the SV40 eariy promoter was used, there was little 
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dperence in the levels of EPO between cell types. However, when 
the HCMV MIEP was used, DE cells produced much higher levels of 
EPO than any other cell lines tested. The HCMV MIEP was much 
more active than the SV40 early promoter in almost all the cells tested. 

This difference was especially pronounced in DE cells, where the 
fomier produced 315 times more EPO than the latter. Among the 
various cell types, DE cells always produced the highest level of EPO. 

CHO cells are the source of cell lines producing EPO that is currently 
used for human application. In this transient system, however, the 
level of EPO in CHO cells was at least 30-fbld lower than in DE cells. 
Difference between DE and 293T cells was also considerable. 
Transfection efficiency of 293T was higher by about 30-folder than any 
other cells including DE cells. Moreover, 293T cells produce potent 
viral transcription transactivators. Nevertheless, DE produced 10- 
folder more EPO than 293T, suggesting that DE could drive high levels 
of the gene expression. 

In conclusion, human EPO could efficiently be produced and 
secreted in DE cells and that the HCMV MIEP is the promoter of choice 
for driving high level heterologous gene expression in DE cells. 

In summary, we found that DE cells could produce very high 
levels of bacterial and human proteins. All three promoters tested 
drove higher levels of gene expression in DE cells than any other cell 
lines used in this study. In particular, the HCMV MIEP was extremely 
powerful in DE cells. The high level of heterologous gene expression 
observed was not due to a higher number of transfected cells. It 
appears that DE cells properly process splicing and secretion because 
transfection of DE cells with an expression vector containing the EPO 
genomic DNA sequence produced a large quantity of EPO in the 
culture supematant. 
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For DE cells to be used for industrial purpose, one would need to 
develop large-scale culture techniques for these cells. There are two 
possible ways. First, it may be possible to use primary cells 
themselves as the producer line. A large number of DE cells can 
easily be prepared from 10- to 13 day-old duck embryos. From one 
embryo, we can readily obtain 1 0* to 1 0" cells that can undergo at least 
15 passages. Therefore, it is possible to transfect DE cells at the 
earliest possible stage with an expression vector followed by selection 
of transfected cells, which might require 4-7 passages. Even if less 
than 5% of the cells were transfected, a large number of transfected 
cells would be available, suggesting that large-scale culture of primary 
duck embryo cells is not impossible with primary cells. Second, it will 
be possible to transform duck embryo cells at an early stage, using one 
of the large number of well-characterized oncogenes that are available. 
With transfomied DE cells, a producer line could be constructed, and 
better quality control of protein production be established. It remains 
to be seen whether transformed DE ceils will still maintain the capability 
for high level gene expression. Although a number of biological 
questions remain to be answered, the potential of these cells for the 
production of various proteins warrants further investigation. 

VIII. Heterologous Gene Expression 
in the Transfomried Avian Ceil Line 

The above experiments demonstrated the great potential of DE 
cells as producers of heterologous proteins such as EPO. However, 
DE cells used in the above experiments are primary cells and stop 
dividing after 30-40 passages in vitro. Therefore, unless DE cells are 
transfomied or special techniques are developed as described above, it 
is difficult to use these embryonic cells for industrial production of 
heterologous proteins. 
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In the following study, we tested whether the transfomied avian 
cell line, namely the quail fibrosarcoma line, could be used to produce 
EPO. The quail fibrosarcoma line used in this study, QT-VC, was 
subcloned from QT6 (ATCC CRL1708). This line was derived from 
methyichoianthrene-induced fibrosarcoma of Japanese quail. QT-VC 
is different from its parental line in at least two aspects. First, QT-VC 
grows faster than the parental line in M199 medium containing 10% 
FBS used in this study. The fomrier divided every 12-24 hours, while 
the doubling time of the latter was 24-36 hours. Second, the QT-VC 
cell looks more roundish than QT6 which generally grows in a longish 
form. Like its parental line, QT-VC did not grow well when it was 
seeded at a low density. Therefore, ceils had to be split to 1/3 to 1/2 
after reaching confluence for continuous culture. 

1 . Analysis of Gene Expression in QT-VC Cells 

We compared the levels of gene expression between QT-VC 
and mammalian cells using pCMV-CAT. We chose to use the HCMV 
MIEP as this promoter was shown to drive high levels of gene 
expression in various cell types including avian cells (See the section 
IV). pCMV-CAT was transfected into 3 cell lines, QT-VC, CH0-K1 and 
Vero. To make this comparison semi-quantitative, all transfections 
and CAT assays were performed at the same time and using identical 
conditions. Transfection efficiency was also measured by 
cotransfecting pCM-lacZ followed by X-gal staining. The efficiency 
was approximately 3 % in all cases. Under these conditions, the 
levels of CAT expression in QT-VC cells were always 2-3 times higher 
than mammalian cell lines used in this study (Fig. 7). Although the 
level of gene expression in QT-VC cells appears to be lower than DE 
ceils, the quail fibrosarcoma line is at least as good as mammalian ceil 
lines, suggesting that it could be used as a producer for heterologous 
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proteins. 

2. Construction of EPO Expression Vectors for QT-VC Ceils 

To test wiiether fiigh levels of heterologous proteins could be 
expressed in QT cells, we have constructed other EPO expression 
5 vectors. The basic strategy for the construction of an expression 
vector was as follows: 

First, we chose to use the HCMV MIEP to drive expression of the 
heterologous gene as it had already been shown to be one of the 
strongest pronriotere in avian cells as well as mammalian cells. 

10 Second, the human glutamine synthetase (GS) gene was used 

for amplification of the target gene. Generally, the gene of interest is 
amplified to augment the yield of protein by using certain selectable 
markers in the presence of specific chemicals. One of the best 
examples is the dihydrofolate reductase (DHFR) gene. It has been 

15 shown that the copy number of the heterologous gene and the level of 
respective protein increase as the concentration of methotrexate (MTX) 
in the medium is slowly increased. However, this system requires the 
host cell defective in the gene DHFR, so cannot be directly applied to 
QT cells for which such a mutant line is not yet available. For this 

20 reason, we chose to use the GS gene. In this case, the host cell line 
need not to be deficient for GS, because only multiple copies of the GS 
gene can confer resistance to methionine sulfoximine (MSX). 

The overall structure of EPO expression vectors constructed for 
the use in QT cells is shown In Fig. 8. In this structure, the cDNA 
25 sequence for EPO is under the control of the HCMV MIEP, the bacterial 
Neo gene is used as the first selectable mariner, and the human GS 
gene is also present as the second selectable marker in the same 
plasmid. The backbone of expression vectors used in this particular 
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experiment was pCI-neo (Promega, USA) which uses the HCMV MIEP 
and the Intron from the |J-globin genome. We have made a couple of 
different constructs in which the GS gene is driven by the partial MMTV 
LTR (from -220 to +15) or the 220 bp HSV tl< promoters. In either 
5 case, the magnitude of gene amplification appears to be comparable 
(Data not shown). Detailed procedure and supplementary data 
regarding the construction of expression vectors is available upon 
request. 

3. Construction of QT-VC Cells Stably Expressing EPO 

10 To construct QT-VC cell lines constitutively expressing EPO, the 

cells were transfected with an EPO expression vector by a calcium 
phosphate coprecipitation method as described in the section II. 
Three days after transfection, EPO production was confinned by EILSA 
and transfected cells Were treated with G418 (0.8 mg/ml) and MSX (25 

15 ^iM). When G41 8-resistant cells were grown to confluence, cells were 
diluted for sublconing. Because QT-VC cells do not grow efficiently at 
a low cell density, cells were seeded on 1 0-cm culture dishes at various 
numbers (10^ 10\ 10^ 10^ per dish). Then the colonies that grew 
distant from other colonies were isolated by plastic O rings and 

20 expanded onto a 96-well plate. When cells reached 70 % confluence, 
the EPO level was measured. Subclones that produced more than 
200 U/ml were serially expanded from the 12-well to 6-well to 60 mm 
culture plates. When cells reached confluence on a 60 mm dish, cells 
were split on 6-well plates and then treated with various concentrations 

25 of MSX (100 ^M, 250 ^M, 1 mM). Using this procedure, several 
subclones that produced large amounts of EPO and also grew fast 
were selected. 

One of the subclones obtained through this procedure is QT- 
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N4D4. As shown in Fig. 9, this subclone produced 1200 U/ml when 
grown for 3 days after confluence. When the cells were split to 1:3, 
seeded on 10 cm dishes, and allowed to grow for another 3 days, N4D4 
still produced 1000 U/ml. The medium was then replaced with a fresh 
5 media containing 2 % FBS and the cells still produced 400 U/ml EPO. 
These results indicated that QT cells could produce a large quantity of 
EPO. 

In conclusion, the above experiment demonstrated the great 
potential of QT cells as a producer for heterologous protein. 

"•o IX. Biological Activity of EPO Produced in Avian Cells 

EPO is heavily glycosylated and such glycosylation is required 
for its biological activity. For example, EPO produced in E coli or 
yeast Is Inactive or very weakly active in vivo. To test whether EPO 
expressed in DE or QT cells was biologically active, we carried out an in 
15 vita bioassay using spleen cells isolated from mice treated with 
phenylhydrazine. 

EPO assay: Absolute levels of EPO production after 
transfection of various cells were determined by enzyme linked 
immunoadsorisent assay which is currently used to, measure EPO 

20 levels in the human serum (R&D Systems Inc., Minnesota, USA). To 
measure the biological activity of EPO, in vitro bioassay was carried out 
by the method of Krystal as modified by Goldberg et al Spleen ceils 
were taken from C57BL X C3H Fl hybrid mice (Seoul National 
University Laboratory Animal Center) on day 3 after the second of two 

25 daily injections of phenylhydrazine (60 mg/Kg of body weight per day) 
and spleen cell suspensions were prepared with Lymphoprep™ 
(NYCOMED PHARMA AS, Oslo, Nonway). The spleen cells (final 
concentration 4X10^ cells per ml) were then incubated in 24 well tissue 
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culture plates with various standard doses of EPO (CILAG AG 
International, Switzerland; specific activity 2000 U/ml) or unknown 
samples for 22 hr and then pulsed with 4 ^Ci/well tritiated thymidine 
(Amersham Co.) for 2-3 hr. The cells were harvested, washed with 
5 PBS several times and lysed by 0.3 N NaOH and 0.1% SDS. 
Radioactivity in LSC cocktail solutions were calculated by a Pharmacia 
Wallac 1410 scintillation counter. 

Culture supematants from QT-N4D4 cells or DE cells transfected 
with EPO expression vectors were taken to measure levels of EPO by 

10 both ELISA and the bioassay. The ELISA measures absolute 
concentration, and is currently used for detennining EPO concentration 
in human serum. On the other hand, the bioassay determines 
biological activity using a control EPO that has been produced from 
mammalian cells and is cuniently being used in humans. Fig. 10 

15 compares the difference in levels of EPO determined by these two 
methods. The ratio between the values (mU) was I ± 0.15, and the 
specific activity of EPO produced from DE cells was estimated to 105 
U/fig. Therefore, the levels of EPO measured by ELISA were very 
comparable to those obtained by the bioassay. This result suggested 

20 that EPO produced from these avian cells had a similar biological 
activity to commercially available EPO. 
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What is claimed is: 

1 . A heterologous gene expression system comprising: 
a DNA encoding a heterologous protein; 

a vector for receiving the DNA; and 
5 an avian cell for harboring the vector. 

2. The expression system of claim 1 , wherein the heterologous 
protein Is selected from the group consisting of TPA, Factor VIII and 
EPO. 

3. The expression system of claim 1, wherein the vector 
10 contains a promoter selected from the group consisting of SV early 

promoter, HCMV MIEP and RSV LTR. 

4. The expression system of claim 1, wherein the avian cell is 
selected from the group consisting of DE, CEF and QT. 

5. The expression system of daim 4, wherein the QT is QT-VC. 

15 6. The expression system of claim 1, wherein the DNA 

encoding the heterologous protein is DNA or cDNA. 

7. An avian cell as a host for expressing a gene encoding a 
mammalian heterologous protein. 

8. A method of producing a heterologous protein comprising 
20 the steps of; 

culturing the avian cell containing the expression system of claim 
1 to express the gene of the heterologous protein in media; and 

purifying the heterologous protein from the cell and the media. 

9. The method of claim 8, wherein the heterologous protein is 
25 selected from the group consisting of TPA, Factor VIII and EPO. 
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10. The method of claim 8, wherein the vector contains a 
promoter selected from the group consisting of SV early promoter, 
HCMV IVIIEP and RSV LTR. 

11. The method of claim 8, wherein the avian cell is selected 
5 from the group consisting of DE, CEF and QT. 

12. The method of claim 1 1 , wherein the QT is QT-VC. 

13. The method of claim 8, wherein the DNA encoding the 
heterologous protein is DNA or cDNA. 

14. An EPO production system comprising: 
10 a DNA encoding EPO; 

a vector for receiving the DNA; and 
an avian ceil for harboing the vector. 

1 5. The EPO production system of claim 14, wherein the avian 
cell is DE or QT. 

16. The EPO production system of claim 1 5, wherein the QT is 
QT-VC. 

17. The EPO production system of claim 14, wherein the DNA 
is a genomic DNA encoding EPO. 

18. The EPO production system of claim 14, wherein the DNA 
20 encoding EPO is selected from the group consisting of SY, JM, SH and 

HE described in Fig. 5. 

19. The production system of claim 14, wherein the vector 
contains a promoter selected from the group consisting of SV early 
promoter, HCMV MIEP and RSV LTR. 

25 20. A method of producing EPO comprising the steps of. 
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inserting a DNA encoding an EPO into a vector; 
transfecting tlie vector Into an avian cell; and 
culturing the transfected avian cell in media. 
21 . The method of claim 20, wherein the avian cell is DE or QT. 
5 22. Them method of claim 21 , wherein the QT is QT-VC. 

23. The method of claim 20, wherein the DNA encoding EPO is 
a genomic DNA. 

24. The method of claim 20, wherein the DNA encoding the 
EPO is selected from the group consisting of SY, JM, SH and HE 

10 described in Fig. 5. 

25. The method of claim 20, wherein the vector contains a 
promoter selected from the group consisting of SV40 early promoter, 
RSV LTR and HCMV MIEP. 

26. An EPO genomic sequence selected from the group 
15 consisting of SY, JM, SH and HE described in Fig. 5. 

27. An EPO amino acid sequence selected from the group 
consisting of JM, SH and HE described in Fig. 6. 
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FIG.5A 



ATG66GGTGCACGAAT6TCCTGCCTG6CT6T6GCTTCTCCTGTCCCTGCT 50 

ATGGGGGTGCACGAATGTCCTGCCTGGCTGTG6CTTCTCCTGTCCCTGCT 50 

ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCnCTCCTGTCCCTGCT 50 

ATGGGGGTGCACGAATGTCCT6CCTGGCTGTGGCTTCTCCTGTCCCT6CT 50 

ATCGGGGTGCACGAATGTCCTGCCTG6CTGTGGCTTCTCCTGTCCCT6CT 50 

ATGGGGGTGCACGAATGTCCTGCCTGGCTOTG6CTTCTCCTGTCCCTGCT 50 

GTCGCTCCCTCTGGGCCTCCCAGTCCTG6GCGCCCCACCACGCCTCATCT 100 

GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 

GTCGCTCCCTCT6G6CCTCCCAGTCCTGGGC6CCCCACCAC6CCTCATCT 100 

GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCAC6CCTCATCT 100 

GTCGCTCCCTCTGGGCCTCCCAGTCCTGGGCGCCCCACCACGCCTCATCT 100 



GTGACAG.q:GAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 




GTGACAGCbGAGTCCTGGAGAGGTACCTCTTGGAGGCCAAGGAGGCCGAG 150 



GTGACAG 
GTGACAG 

n 7\ A K A 7C 



,GAGTCCTGGAGAGGTACCTCTT6GAGGCCAAGGAGGCCGAG 150 
GTGACAG[:f:GA6TCCT6GAGA6GTACCTCTTGGAGGCCAAGGAGGCCGAG 150 



AATATCACGGTGAGACCCCnCCCCAGCACATTCCACAGAACTCACGCTC 200 

AATATCACGGTGAGACCCCnCCCCAGCACATTCCACAGAACTCACGCTC 200 

AATATCACGGTGAGACCCCnCCCCAGCACAnCCACAGAACTCACGCTC 200 

AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACAGAACTCACGCTC 200 

AATATCACGGTGAGACCCCTTCCCCAGCACATTCCACA6AACTCAC6CTC 200 
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FIG.5B 
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HE 



AGGGCTTCAGGG 

AGGGCTTCA66G 
AGGGCTTCAGGG 
AGGGCTTCAGGG 
AGGGCTTCAGGG' 
AGGGCTTi 
************ 



•WCCTCCCAG 
^ACTCCTCCCAG 
WTCCTCCCAG 
AACTCCTCCCAG 
ftACTCCTCCGAG 
CAGG^G /V\CTCCTCCCAG SKtCCAGGAACCTGGCACTTGG' 



J************ 



ATCCAGGAACCTGGCACnGGrrr 248 

'ITCCAGGAACCTGGCACTTGGTTT 248 

ATCCAGGAACCTGGCACTTGGTTT 248 

'VTCCAGGAACCTGGCACTTGGTTT 248 

^TCCAGGAACCTGGCACTTGGTTT 248 

ITTT 250 



*-'-*■ » 1 ■ ■ 



AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



GGGGTGGAGTTGGGAAGCTA6ACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGnGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGA6TTG66AAGCTA6ACACT6CCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 298 

GGGGTGGAGTTGGGAAGCTAGACACTGCCCCCCTACATAAGAATAAGTCT 300 
****************»*»**»»*»****»^^irt^^^^^^ ^ J, ^ 

6GTGGCCCCAAACCATACCTGGAAACTA6GCAAGGA6CAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCA6CAGA 348 

6GTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

G6T6GCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAG6AGCAAAGCCAGCAGA 348 

GGTGGCCCCAAACCATACCTGGAAACTAGGCAAGGAGCAAAGCCAGCAGA 350 

**** AAA ********* A A * ********************* A' AAAAAjfc*** 



[gcctgtggOccagggHg 



iGCCTGTGQGCCAGGG 
BCCT6T6GECCAGGG 



CCAGGQCCAG 



TCCTAi 

TCCTAd-lGCCTGTGd- 

tcctaciG|Gcctgtggg|ccagggHc4a- , 

TCCTAC 
TCCTAC 
TCCTAC : 

*****:»j_l********LI******n** 



/SGCCTTCAGGGACCCTTGACTCC 397 

ATCTTCAGGGACCCTTGACTCC 395 

MCCTTCAGGGACCCTTGACTCC 397 

|GCCTGTGddCCAGGGbc$dAycCTTCAGGGACCCTTGACTCC 398 



cA-a 

CA 



GAfiCCTTCAGGGACCCTTGACTCC 397 
GASCCTTCA6G6ACCCTTGACTCC 399 



' ' ' i.ii ii I I t 
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CCGGGCTGT3rGCATT|f 

CCGGGCTGT3rGCAnr 
CCGGGCTGTirGCATTr 



CC6GGCTGT3 
CCGGGCTGTSrGCATTfc 



********* 



>TTr 

fTGCATTr 



:AGAC|GGGCTGTGCT6AACACTGCAGCTTGAAT 447 

:AGAfc3G6CTGTGCTGAACACTGCAGCTrGAAT 445 

:AGAC SGGCTGTGCTGAACACTGCAGCnGAAT 447 

:AGAAGGGCTGTGCT6AACACTGCAGCTTGAAT 448 

:aGA :GGGCTGTGCTGAACACT6CAGCnGAAT 447 



AGA:GGGCTGTGCTGAACACTGCAGCTT6AAT 449 
j****U*******.********************* 



GA|WTATCACTGTCCCAGACACCAAAGTTAAnTCTAT6CCTGGAAGAG 497 

GA^MTATCACTGTCCCAGACACCAAAGnAATTTCTATGCCTGGAAGAG 495 

GA i\ WATCACTGTCCCAGACACCAAAGnAATTTCTATGCCTGGAAGAG 497 

GA 3 WATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 498 

GTOTATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 497 

GA 3 WATCACTGTCCCAGACACCAAAGTTAATTTCTATGCCTGGAAGAG 499 

yic************************vbHt*****************^ 



** 



GATGGAGGTGAGTTCCT 
GATGGAGGTGAGnCCT 
GATGGAGGTGAGTTCCT 
GATGGAGGTGAGnCCT 
GATGGAGGTGAGTTCCT 
GATGGAGGTGAGTTCCT 



nTCCTTTCTTTTGGAGAATCT 547 

ntrCCTTTCTTTTGGAGAATCT 545 

rnrCCTTTCTTTTGGAGAATCT 547 

nrCCTTTCTTTTGGAGAATCT 548 

--irCCTTTCTTTTGGAGAATCT 545 

njrCCTTTCTTTTGGAGAATCT 549 



AM 
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JM 
SH 
HE 



,TTTT 



CATTTGCGAGCCTGA" 
CATTTGCGAGCCTGAT 
CATTTGCGAGCCTGA" 

CATTTGCGAGCCTGA' 
CATTTGCGAGCCTGA" 
CATTTGCGAGCCTGATTT 

-> — i„i-.i-.t..t 1 ■ » » 



iGATGAAAGGGAGAlAlrGATCi 
iGATGAAAGGGAGAhfrGATCi 
3GATGAAAGGGAGAA|TGATcdfl|GGGAAAGGT 
BGATGAAAGGGAGA|3|rGAT0 ' 
3GATGAAAGGGAGAWGATC' 
3GATGAAAGGGAGA'\rGATCL 

»r*************U(r*****[_ 



^GGAAAGGT 597 
IGGAAAGGT 595 
597 

iGGAAAGGT 598 

GGAAAGGT 595 
IGGAAAGGT 599 
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AM 
GI 
SY 
JM 
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HE 



AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCKpTCTA 647 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTC ^dGTCTA 645 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCAdGTCTA 647 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCAcEtCTA 648 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTckctBTCTA 645 

AAAATGGAGCAGCAGAGATGAGGCTGCCTGGGCGCAGAGGCTCCASTCTA 649 



TAATCCCAGGCTGAGAgSGCCG 



TAATCCCAGGCTGAGAljSGCCG, 



taatcccaggctgaga- 
taatcccaggctgaga:! 

taatcccaggctgaga 

**************** 



^TGGGAGAATTGCTTGAGCCCTGGAG 697 

^TGGGAGAATTGCTTGAGCCCTGGAG 695 

TAATCCCAGGCTGAGAjrpGCCGAjA ^VTGGGAGAAITGCTTGAGCCCTGGAG 697 

ATG6GAGAATTGCTTGAGCCCTGGAG 698 

iKtGGGAGAATTGCTTGAGCCCTGGAG 695 

iATGGGAGAATTGCTTGAGCCCTGGAG 699 



]GCCG/ 

lt*****l 



«c************************ 



AM 
GI 
SY 
JM 
SH 
HE 



GTTCAGACCAACCTAGGCAGCpfTAGTGAGATCCCCCATCTCTACAAACAT 747 

GTTCAGACCAACCTAGGCAGC f AGTGAGATCCCCCATCTCTACAAACAT 747 
GnCAGACCAACCTAGGCAGC f AGTGAGATCCCCCATCTCTACAAACAT 747 
GTTCAGACCAACCTAGGCAGCHrAGTGAGATCCCCCATCTCTACAAACAT 748 

745 

rAGTGAGATCCCCCATCTCTACAAACAT 749 



GTTCAGACCAACCTAGGCAGC : 
*********************! ******** 



GTTCAGACCAACCTAGGCAGC ^ rAGTGAGATCCCCCATCTCTACAAACAT 



AM TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 797 

GI TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATAH 795 

SY TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATAH 797 

JM TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATAH 798 

SH TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATAH 795 

HE TTAAAAAAATTAGTCAGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATT 799 

******** A. A * It ************ A A A * * ***************»»»»»» 
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AM 
GI 
SY 
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AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



T6GteTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTGj5l3GCTGC(^ 
TGG^S ft 5GCTGAGGC6GGAGGATC6CTT6AGCCCAGGAATTTG A SGCTGC ^ 
TGGMBGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAAnTGASGCTGC^ 
TGG/iAmGAGGCGGGAGGATCGCTT6AGCCCAGGAATTTGA3GCTGC3 
TGG/Sft3GCTGAGGCGGGA6GATC6CTTGAGCCCAGGAATTTG33GCTGCA 
TGG/ijAfiGCTGAGGCGGGAGGATCGCTTGAGCCCAGGAATTTG A 3GCTG0A 



**** 



lc* A " A A A A A' A A * ******** 



fc*****L 



gtgagctgtgatcacaccactgcaorccagcctcagtgacagastgaggc 
gtgagctgtgatcacaccactgc/s c rccagcctcagtgacaga 3 tga6gc 
gtgagctgtgatcacaccactgc/^gtccagcctcagtqacagaRtgaggc 
gtgagctgtgatcacaccactgcactrccagcctcagtgacagabftgaggc 
gtga6ctgtgatcacaccactgcaftrcca6cctcagtgacaga3rgaggc 
gtgagctgt6atcacaccactgca ctccagcctcagtgacaga } fgaggc 

************************Ulr***************** ****** 



CCTGTCTC 

CCTGTCTC, 
CCTGTCTC 
CCTGTCTC 
CCTGTCTC 
CCTGTCTC 

iiiiiiiii iiiiiii ii it I 1 t t 1 1 J 




********* 



ATACgrrCATTAnCATTCACTCACTCACTCACTCAT]i]CATTCAnCAn 
ATACaTTCATTATTCATTCACTCACTCACTCACTCATirCATTCATTCAn 
ATAC CrCATTAnCATTCACTCACTCACTCACTCAT C lAHCATTCATT 
ATAqftirrCATTATTCATTCACTCACTCACTCACTCATfr 
ATAClAjlTCATTATrCATTCACTCACTCACTCACTCA 
ATACAiTTCATTAnCATTCACTCACTCACTCACTCA' 



,t|t|c 
■*l>' 



DATTCATTCATT 

ATTCATTCATT 

iATTCATTCATT 
*********** 



847 
845 
847 
848 
845 
849 



897 
895 
897 
898 
895 
899 



.GAAAAAAGAAAAATPATGAGGGCTGTATGGA 947 

lAAAAGAAAAAAGAAAAATflkTGAGGGCTGTATGGA 945 

iAAAAGAAAAAAGAAAAATAATGAGGGCTGTATGGA 947 

iAAAAGAAAAAAGAAAAATAATGAGGGCTGTATGGA 948 

BAAAAGAAAAAAGAAAAATaKtGAGGGCTGTATGGA 945 

lAAAAGAAAAAAGAAAAAljTiATGAGGGCTGTATGGA 949 

«:******************|j)t 



997 

995 
997 
998 
995 
999 
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FIG.5F 
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AM 
GI 
SY 
JM 
SH 
HE 



AM 
GI 
SY 
JM 
SH 
HE 



CATTCAAC, 
CATTCAAC 
CATTCAAC 
CAHCAAC 
CATTCAAC 
CAHCAAC 



******: 




TCTTATTGCATACCnCTGTTTGCTCAGCTTGGTGCTflfe 
STCnATTGCATACCTTCTGTrTGCTCAGCTTGGTGCTTG 
TCnATrGCATACCnCTGTTTGCTCAGCTTGGTGCTf 
iTCTTATrGCATACCTTCTGTTTGCTCAGCTTGGTGCTrB 
TCTTATTGCATACCTTCTGTnGCTCAGCTTGGTGCT T 3 
iTCnATTGCATACCTrCTGTrTGCTCAGCTTGGTGCT 1 3 

k-k***ifk ******* Xi.i.kk kie-Hc***** AAAAAAAA** Jt 




TGAGGGGCAGGAGGGpjSAGGGTGACATgg^CAEnTGACTCCC 
TGAGGGGCAGGAGGGAGAGGGTGACATCCCTCAbbTGACTCCC 
TGAGGGGCAGGAGGGhpAGGGTGACATGGGTCAGCTGACTCCC 
:TGAGGGGCAGGAGG(#GAGGGTGACATCGdrCAbpTGACTCCC 
ITGAGGGGCAGGAGGGNSAGGGTGACATCGGfTCAGCTGACTCCC 
:TGAGGGGCA6GA6GG/^A6GGT6ACATlSGG|TCAKbTGACTCCC 

iJfcAAAAAAAAAAAA * *4iHr*********i U**! 



1047 

1045 
1047 
1048 
1045 
1049 



1097 
1095 
1097 
1098 
1095 
1099 



AGAGTCCACTCCCTGTj^GTCGGGCAECAGGCCGTAGAAGTCTGGCAGGG 1147 

AGAGTCCACTCCCTGWTCGGGCAlsbAGGCCGTAGAAGTCTGGCAGGG 1145 

AGAGTCCACTCCCTGT OTCGGGCAl^pAGGCCGTAGAAGTCTGGCAGGG 1147 

TTpGTCGGGCAStAGGCCGTAGAAGTCTGGCAGGG 1148 

■ '"GTCGGGCApCAGGCCGTAGAAGTCTGGCAGGG 1145 

. 16TCGGGCAGCAGGCCGTAGAAGTCT6GCAGGG 1149 



AGAGTCCACTCCCTG 

AGAGTCCACTCCCTGT 

AGAGTCCACTCCCTGT 



CCTGGCCCTGCTGTCGGAAppTGTCCTGCGGGGCCAGGCCCTGHGGTCA 1197 

CCTGGCCCTGCTGTCGGAAiSCTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1195 

CCTGGCCCTGCTGTCGGAAGDTGTCCTGCGGGGCCAGGCCCTGnGGTCA 1197 

CCTGGCCCTGCTGTCGGAAjsbTGTCCTGCGGGGCCAGGCCCTGTTGGTCA 1198 

CCTGGCCCTGCTGTCGGAATCTGTCCTGCGGGGCCAGGCCCTGHGGTCA 1195 

CCTGGCCCTGCTGTCGGAmGTCCTGCGGGGCCAGGCCCTGnGGTCA 1199 

*"*'*"""*"*""■ ■- ■ « ' It . t. * 
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FIG.5G 
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ACT TCCCAaCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1247 

ACT ctrrCCCA 3 XGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1245 

ACT TTTCCCA 3 XGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1247 

ACTUtTCCCA SpCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1248 

ACTIQTTCCCAICCGTGGGAGCCCCTGCAGCTGCATGTGGATAAAGCCGTC 1245 

C6TGGGA6CCCCTGCAGCTGCATGT6GATAAAGCCGTC 1249 



ACTcrrcccA] 
*** I****** 



AM agtggccttcgcagcctcaccactctgcttcgggctctgggagcccag 

GI agtggccttcgcagcctcaccactctgcttcgggctctgggagcccai 

SY AGTGGCCTTCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAGbir 

JM AGTGGCCnCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAG il 

SH AGTGGCCnCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAG 

HE AGTGGCCnCGCAGCCTCACCACTCTGCTTCGGGCTCTGGGAGCCCAG 
*********************** 




AM 
GI 
SY 
JM 
SH 
HE 



GAGTAGGAG I 3GACACTTCTGCTTGCCC" 
GAGTAGGAG : SGACACTTCTGCTTGCCCTn 
GAGTAGGAG : SGACACnCTGCTTGCCCTTT 
GAGTAGGAG : 3GACAC1TCT6CTTGCCC 

GAGTAGGAG 3GGACACTTCTGCTTGCCC 



GAGTAGGAG 
********* 



tiGACACTTCTGCTTGCCCTTTl 



"GTAAGAAGGia, 
TGTAAGAAGG 
:rGTAAGAA6G 
jTGTAAGAAGG 



: TGTAAGAAGG 



******1J********** 



lAGAAGG 1347 

i^AGAAGG 1345 

lAGAAGG 1347 

iAGAAGG 1348 

FGTAAGAAGGWSAGAAGG 1345 

3AGAAGG 1349 
******* 



J 3i 



AM GTCTTGCTAAGGAGTACAG 

61 GTCTTGCTAAG6AGTACAGG/> 

SY GTCHGCTAAGGAGTACAGG 

JM GTCnGCTAAGGAGTACAG 

SH GTCTTGCTAAGGAGTAC/s 

HE GTCTTGCTAAGGAGTACAG 

iiiiti I ii, 1 1 1 ■■ '- ■«■ ■' 



TGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

TGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

:TGTCCGTATTCCTTCCCTTTCTGTGGC 1397 

:TGTCCGTATTCCnCCCTTTCTGTGGC 1398 

:TGTCCGTATTCCTTCCCTTTCTGTGGC 1395 

TGTCCGTATTCCnCCCTTTCTGTGGC 1399 
**************************** 
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FIG.5H 
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AM 
GI 
SY 
JM 
SH 
HE 



ACTGCAGCGACCfifcCTG' 

ACTGCAGCGACCltcTg 
ACTGCAGCGACCILCTG 

actgcagcgacotIcctg 

ACTGCAGCGACCfT 

ACTGCAGCGACdACCTG 



ITTTTCTCCnGGCAGAAGGAAGCCATCTCCCCT 1447 
CTCCnGGCAGAAGGAAGCCATCTCCCCT 1445 

■CTCCTTGGCAGAAGGAAGCCATCTCCCCT 1447 



inTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1448 
:CTGTTTTCTCCTTGGCAGAAGGAAGCCATCTCCCCT 1445 
■" ^mrCTCCnGGCAGAAGGAAGCCATCTCCCCT 1449 



CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGApKCTTT 1497 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA icTTT 1495 

CCAGATGCGGCCTCAGCT6CTCCACTCCGAACAATCACTGCTGA :aCTTT 1497 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA 3ACTTT 1498 

CCAGATGCGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA T ^CTTT 1495 

CCA6AT6CGGCCTCAGCTGCTCCACTCCGAACAATCACTGCTGA : CTTT 1499 



CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGG/ 

CCGCAAACTCTrCCGAGTCTACTCCAATTTCCTCCGGGG/! 

CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGC 

CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGG 

CCGCAAACTCnCCGAGTCTACTCCAATTTCCTCCGGGG/ 

CCGCAAACTCTTCCGAGTCTACTCCAATTTCCTCCGGGC 

********************* AAAAAAA*^! 



^GCTGAAGC 


1547 


^GCTGAAGC 


1545 


iGCTGAAGC 


1547 


^GCTGAAGC 


1548 


^GCTGAAGC 


1545 


iGCTGAAGC 


1549 


********* 
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HE 



TGTACACAGGGGA6GCCTGCAGGACAGGGGAC13ATGA 
TGTACACAGGGGA6GCCTGCAGGACAGGGGAC '\ 3ATGA 
TGTACACAGGGGAGGCCTGCAGGACAGGGGAC ^ 3ATGA 
TGTACACAGGGGAGGCCTGCAGGACAG66GACf\GAT6A 
TGTACACAGGGGAGGCCTGCAGGACAGGGGACf\hATGA 
TGTACACAGGGGA6GCCTGCAGGACAGGGGAC I 

***** AAA t.***********irk* kKX ** **** 



1584 
1582 
1585 
1585 
1583 
1AT6A 1586 
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FIG.6 
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AM/GI 

SY 

JM 

SH 

HE 



AM/GI 

SY 

JM 

SH 

HE 



AM/GI 

SY 

JM 

SH 

HE 



MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICI 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICI 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICL^.........,„„^„^ 

MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRL ICDRRVLERYLLEAKEAE 50 
MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICokkvLERYLLEAKEAE 50 



:VLERYLLEAKEAE 50 
VLERYLLEAKEAE 50 

^VLERYLLEAKEAE 50 



**************** 



NITffl3CAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSE^ 

NITrpCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEA 

NITOCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSE'\ 

NITJpCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSEi 

NIT ipCAEHCSLNENITVPDTKVNFYAWKRMEVGQQAVEVWQGLALLSE <\ 
***l>******************************************** . 



100 
100 
100 
100 
100 



VLR6QALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLniLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

VLRGQALLVNSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAISPPD 150 

**************^r»* jr^r*************^b * * A A A A A A * ******** 



AASAAPLRTITADTFRKLFRVi/SNFLRG ( 
AASAAPLRTITADTFRKLFRV /§NFLRG ( 

AASAAPLRTITADTFRKLFRVrSNFLRG ( 



-KL'YTGEACRTGDp 


193 


-KLYTGEACRTGDR 


193 


-KLYTGEACRTGDk 


193 


-KLYTGEACRTGDR 


193 


-KLYTGEACRTGDG 


193 


r, A vrvCftKjtTCxxL 
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FIG.7 
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